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ABSTRACT 


This research aims to manage the risks of spreading Corona-Virus over the world, by specifying the optimal statistical 
modeling to analyze the daily count of new cases of the COVID-19, therefore discrete distributions were needed. A new 
three-parameter discrete distribution has been improved named as a Discrete Marshall—Olkin Lomax (DAPIL) 
distribution. Probability mass function and hazard rate are discussed. Point estimation and confidence interval by using 
maximum likelihood estimation (MLE) for the DAPIL distribution parameters are discussed. A numerical study is done 
using the daily count of new cases in Australia. Monte Carlo Simulation has been performed to evaluate the restricted 


sample properties of the proposed distribution. 
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INTRODUCTION 





In December 2019, Corona-Virus "COVID-19" was started in Wuhan, China. On March 11, 2020, World Health 
Organization (WHO) described COVID-19 as a pandemic. See Figure | and 2. Therefore, countries around the world have 


been increased their measures trying to decrease the spread rate of the COVID-19. 





Americas 16,743,243 
ee net 
South-East Asia 7,161,969 
aes cae 
Europe 5,937,969 
ay confirmed 
Eastern Mediterranean 2,430,239 
== confirmed 
Africa 1,187,269 
= confirmed 
Western Pacific 618,112 
Fa confirmed eli 
Source: World Health Organization 


Jan 31 Feb 29 Mar 31 Apr 30 May 31 Jun 30 Jul 31 Aug31 Sep 30 
Data may be incomplete for the current day or week 


Figure 1: The Situation for the Daily New Cases Over the World by the WHO Region. 
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Figure 2: The Situation for the Daily New Deaths Over the World by the WHO Region. 


To model daily cases and deaths in the world, a natural discrete Lindley distribution has been introduced by AI- 
Babtain et al. (2020). A discrete Marshall-Olkin generalized exponential distribution has been introduced by Almetwally et 
al. (2020) to discuss the daily new cases of Egypt. The study was carried out by Hasab et al. (2020) where they used the 
Susceptible Infected Recovered (SIR) epidemic dynamics of the COVID-19 pandemic for modeling of the novel 
Coronavirus epidemic in Egypt. El-Morshedy et al. (2020) studied a new discrete distribution, called discrete generalized 
Lindley, to analyze the counts of the daily coronavirus cases in Hong Kong and daily new deaths in Iran. An autoregressive 
time series model based on the two-piece scale mixture normal distribution has been used by Maleki et al. (2020) to 
forecast the recovered and confirmed COVID-19 cases. The daily new COVID-19 cases in China have been predicted by 
Nesteruk (2020) and Batista (2020b) by using the mathematical model, called susceptible, infected, and recovered (SIR). 
Batista (2020a) used the logistic growth regression model is used for the estimation of the final size and its peak time of the 


coronavirus epidemic. 


In inverse Lomax (IL) is originally developed as a lifetime distribution. The IL is member of family of 
generalized beta distribution. Kleiber and Kotz (2003) showed that IL distribution can be used in economics. The IL 
distribution has many applications in modeling of different trends of hazard rate function (hrf), i.e., decreasing or upside- 
down bathtub failure rate of life testing of components. Many authors have studied more applications by using extended IL 
distribution. Recently, ZeinEldin et al. (2020) studied and introduced the alpha power extended IL (APIL) distribution. The 


CDF and PDF of APIL distribution with parameters o,6 and $ are given respectively, as 


(2-8) ° i 
Gann —@— > s>@a00s0a+1 
Fixia@,t, et r e W @) Oy Py (1.1) 
and 
; Ina) od BF 4.7" 
fGra.#,6)=——~-(1 +=) atts) a2 0, (1.3) 
a—-lx* x. 


Discrete Burr type XII and discrete Lomax distributions have been introduced by Para and Jan (2016). Discrete 


Lomax (DL) distribution is helpful in modeling discrete data which exhibits heavy tails and can be useful in medical 


science and other fields. The PMF and CDF of DL distribution with parameters e and $ are given respectively, as 
faz zit) 
P(x; P.o) = pla) _ pm"), = Np. @>00<P <1, (1.4) 


F(x;P,8) = 1- pils+3), (1.5) 
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The question may come to mind of any researcher: why do we need discrete distributions? Since In count data 
analysis, we see the most of the existing continuous distributions do not set suitable results for modeling the COVID-19 


cases. The cause for this as we know that counts of deaths or daily new cases show excessive dispersion. 


In order to ensure members of the Australia society from the risks arising from the spread of Corona-Virus in 
Australia, this study aims to model the daily new cases and deaths of the COVID-19 employing a new statistical tool. An 
aspect of the importance of research is the necessity of mathematical and statistical modeling of the extent and spread of 
the Coronavirus. The discrete alpha power inverse Lomax distribution will be introduced that can be denoted as DAPIL 


distribution. 


The rest of the paper is organized as follows. In Section 2, the survival discretization method is used. Discrete 
alpha power inverse Lomax distribution and maximum likelihood estimation of DAPIL are introduced in Section 3. 
Section 4 deals with Simulation study by using bias and MSE to inference MLE estimator. Daily new cases of COVID-19 
in the case of Australia are used to validate the use of models in fitting lifetime count data are presented in Section 5. 


Finally, conclusions are provided in Section 6. 
SURVIVAL DISCRETIZATION METHOD 


In the statistics literature, sundry methods are available to obtain a discrete distribution from a continuous one. The most 
commonly used technique to generate discrete distribution is called a survival discretization method, it requires the 
existence of cumulative distribution function (CDF), survival function should be continuous and non-negative and times 


are divided into unit intervals. The PMF of discrete distribution is defined in Roy (2003, 2004) as follows 
POX =x) =P 3 X¥gx4+1) = Sx) —S@& +1);x =0,1,2,.., (2.1) 


Where S(x} = PC = x) = Flax: ), where F(x: is a CDF of continuous distribution and * is a vector of 


parameters. The random variable X is said to have the discrete distribution if its CDF is given by 


P(X <x} = FG +1; 9). 





Pt x=a) 
The hazard rate is given by Ar(x) = a The reversed failure rate of discrete distribution is given as 
Fixsal 
wfrlx) — aa 


DISCRETE ALPHA POWER INVERSE LOMAX DISTRIBUTION 


In this Section, we introduce a new flexible discrete model, can be donated as discrete alpha power inverse Lomax 


(DAPIL) distribution. Parameter estimation of DAPIL distribution is discussed by using MLE. 
The DAPIL Distribution 


The continuous APIL distribution is introduced by ZeinEldin et al. (2020). The survival function, of the APIL distribution, 
is given by 
_ gat 


—1 


a 
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Using the survival discretization method and survival function of APIL distribution, we define the PMF of the 


DAPIL distribution as given below 


atte) _ g(t” 
P(x; a,0.4) a ane = 0,1,2,... (3.1) 
— 6 
Let? = * then” —e = lithe PME can be rewritten as following 
a ed 7 on 1-5} 
Poe e y] — A, 
f a rs Pe a = 1 a (3.2) 


Figure 3 shows the PMF plots for different values of the model parameters. From Figure 3, the PMF of the 
DAPIL distribution is unimodal and right-skewed. 
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Figure 3: The PMF Plots of the DAPIL Distribution. 


The CDF of the DAPIL distribution is given as below 


amten$s) 
F(x:a,8.p) = rT 


Where HM, = {0,1,2....#. Survival function of the DAPIL distribution is given by 


— 
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The Ar function of the DAPIL distribution is given by 
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By using Eq. (3.3), Figure 4 shows the HRF plots of the DAPIL distribution. 
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(3.3) 


It is noted that the shape of the HRF 
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Figure 4: 


Parameter Estimation 
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The HRF Plots of the DAPIL Distribution. 


The unknown parameters of the DAPIL distribution are obtained by the maximum likelihood estimation (MLE) method. 


This method is based on the maximization of the log-likelihood for a given data set, assume that # = (ry.%g0.0.%,7 is a 


random sample of size n from a DAPIL (¥. F. #) distribution. By using Eq. (3.2), the log-likelihood function becomes 
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The estimate of the parameter by using MLE can be obtained by a numerical analysis such as the Newton— 
Raphson algorithm. 


Asymptotic Confidence Intervals 


In this subsection, we propose the asymptotic confidence intervals (CI) using MLE method to construct the confidence 


intervals for the parameters. First, we obtain! (é&, #,) which is the observed inverse Fishers information matrix and it is 


defined as: 
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See second derivatives of the logarithm of the likelihood function as follows: 
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Suppose the asymptotic variance-covariance matrix of the parameters &, #, 2 is 


V(a@aA=ri(ada)=llee Tea | 
For interval estimation, a 100(1 — +)% confidence interval for parameter &, #,9 can be constructed based on 


the asymptotic normality of the MLE. 
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SIMULATION STUDY 


A simulation study is assumed to evaluate and compare the behavior of the estimates with respect to their bias and mean 
square error (MSE). We generate 10000 random sample = ¥4,.¥3,..., Of sizes, n = 50, 100, 150 and 200 from DAPIL 


distribution. Different sets of parameters are obtained. 


The MLE of and _ are computed. Then, the bias and MSE of the estimates of the unknown parameters are 


computed. Simulated outcomes are listed in Tables 1-2 and the following observations are detected. 
e The bias and MSE decrease as sample sizes increase for all estimates (see Tables 1-2). 
e =©The bias and MSE of MLE for _ @, festimate is increasing with increased and and fixed , 


¢ The bias and MSE of MLE for #, @ estimates are decreasing with increased @ and fixed 9, &. 
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Table 1: Bias, MSE and CI of Parameters of DAPIL Distribution 
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APPLICATION ANALYSIS 


In this section, the DAPITL distribution is fitted to more famous fields of survival times of Covid-19 in Australia. We 
compare the fits of the discrete Marshall-Olkin generalized exponential (DMOGEx) [Almetwally et al. (2020)] model, 
discrete generalized exponential (DGEx) [Nekoukhou et al. (2013)], natural discrete Lindley (NDL) [Al-Babtain et al. 
(2020)], discrete Gompertz exponential (DGzEx) [El-Morshedy et al. (2020)], discrete Burr (DB) [Krishna and Pundir 
(2009)], discrete Lindley (DLi) [G6mez-Déniz and Calderin-Ojeda (2011)], discrete Lomax (DLo) [Para and Jan (2016)] 


and Geometric models in Tables 3. 


Table 3, provide values of AIC, CAIC, BIC, HQIC and Kolmogorov- Smirnov (KS) statistic along with its P- 
value for the all models fitted based on real data set of Australia. In addition, this table contains the MLE and standard 
errors (SE) of the parameters for the considered models. The fitted DAPIL, PMF, CDF, PP-plot and QQ-plot of the data 
sets of Australia are displayed in Figure 5. These figures indicate that the DAPIL distribution get the lowest values of AIC, 
CAIC, BIC, HQIC, KS and largest P-value, among all fitted models. 


This is a COVID-19 data belong to Australia of 32 days, that is recorded from 3September to 4October 2020. This 
data formed of daily new cases. The data are as follows:6, 15, 59,11, 5, 9, 8,11, 7, 9,6,7, 6, 0, 8, 8, 5, 7,5, 2, 3,5, 2, 8, 1, 2, 
3, 7,4, 2, 2, 3. 


Table 3: ML Estimates, K-S, P-Values, AIC, CAIC, BIC and HQIC for COVID-19 Data in Australia Data 





















































Estimate SE KS P-Value AIC CAIC BIC HQIC 

@ | 0.0030 0.0039 

DAPILo | #@ | 2.9569 1.4151 | 0.1533 | 0.4393 190.4229 | 191.2801 | 194.8201 | 191.8805 
P| 0.0074 0.0121 

DB =e oe 0.4080 | 0.0000 | 218.2952 | 218.7090 | 221.2267 | 219.2669 
Be | 0.8731 0.0781 

Dii P| 0.7943 0.0231 | 0.1620 | 0.3704 | 196.2441 | 196.3774 | 197.7098 | 196.7300 
a | 0.8287 0.0332 

DGE 3 18918 0.545] 0.1442 | 0.5190 | 195.3791 | 195.7928 | 198.3105 | 196.3508 
a@ |) 2.6244 0.4757 

DMOGE | # | 0.0075 0.0089 | 0.1885 | 0.2056 | 190.8208 | 191.6780 | 195.2180 | 192.2784 
P| 0.0294 0.0190 

Geom P|} 0.1194 0.0198 | 0.2546 | 0.0315 198.0333 | 198.1666 | 199.4990 | 198.5192 
@ | 28.3501 | 16.2203 
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Impact Factor (JCC): 5.3784 NAAS Rating 3.45 


Discrete Alpha Power Inverse Lomax Distribution with Application of COVID-19 Data 59 





























cs 
7 co) 
<o = 
= 
< PI 
= cs _ s 
= = = 
= = = 
co = 
Ss 
~ Ss 
= 
o s 
= s t T T T T T 1 
oO 10 20 30 40 50 60 
x x 
Q-Q plot for DAPIL 
S S = 
S 2 
s 
Ss 
= o 
=> = > 
> SS 
S = = 
os Ss 
a 
2 NN 
= aoe Ss 
[—J 
o > 
T T T T T T T > T T T T T 
° 5 10 15 20 25 30 35 0.0 0.2 o4 06 os 1.0 
quantile(x) probability(x) 


Figure 5: Estimated PMF, CDF, PP-Plot and QQ-Plot of DAPIL for COVID-19 data in Australia Data. 


CONCLUDING REMARKS 


In this article, with the aim of managing the risk of spreading Coronavirus in Australia, we proposed and studied the 


discrete Marshall—Olkin Lomax distribution. The maximum likelihood estimation method is discussed to estimate the 


parameter of DAPIL distribution. Monte Carlo Simulations are obtained to evaluate the restricted sample properties of the 


DAPIL distribution. We prove empirically that the DAPIL model reveals its superiority over other competitive models as 


Marshall-Olkin generalized exponential, discrete generalized exponential, natural discrete Lindley, discrete Gompertz 


exponential, discrete Burr, discrete Lindley, discrete Lomax and Geometric for analysis daily new cases of the COVID-19 


in the case of Australia. These figures indicate that the DAPIL distribution gets the lowest values of AIC, CAIC, BIC, 


HQIC, KS, and the largest P-value, among all fitted models. 
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